Mel spectrogram-based audio forgery detection using CNN

نویسندگان

چکیده

In this time of technology, digital speech can be created and falsified by a very diverse hardware software technologies. Audio copy-move forgery is an audio technique that goals to create forged hiding undesirable words or repeating wanted in identical speech. Therefore, authentication has been necessary requisition. study, effective approach spectral images based on detection using convolutional neural networks (CNN) with data augmentation proposed. There are only few handcrafted methods conducted for the forgery. None existing works proposed deep feature learning from recording Mel spectrogram. This first method employ spectrogram The CNN architecture classifies suspicious into two classes: original forged. system successfully trained these image extraction. algorithm tested our datasets generated Arabic Speech Corpus TIMIT database. results show effectiveness, robustness post-processing operations, high accuracy compared other studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Boundary-based Image Forgery Detection by Fast Shallow CNN

Image forgery detection is the task of detecting and localizing forged parts in tampered images. Previous works mostly focus on high resolution images using traces of resampling features, demosaicing features or sharpness of edges. However, a good detection method should also be applicable to low resolution images because compressed or resized images are common these days. To this end, we propo...

متن کامل

A Shift-Recompression-Based Approach to AAC Audio Forgery Detection

Advanced Audio Coding (AAC), a standardized lossy compression scheme for digital audio, which was designed to be the successor of the MP3 format, generally achieves better sound quality than MP3 at similar bit rates. While AAC is also the default or standard audio format for many devices and AAC audio files may be presented as important digital evidences, the authentication of the audio files i...

متن کامل

Raw Waveform-based Audio Classification Using Sample-level CNN Architectures

Music, speech, and acoustic scene sound are often handled separately in the audio domain because of their different signal characteristics. However, as the image domain grows rapidly by versatile image classification models, it is necessary to study extensible classification models in the audio domain as well. In this study, we approach this problem using two types of sample-level deep convolut...

متن کامل

Vision Based Image Forgery Detection

Abstract: Nowadays, digital images can be easily modified by using high-performance computers, sophisticated photo-editing, computer graphics software, etc. These modifications will affect the authenticity of images, from law, politics, the media, and business. Detecting forgery in digital images is one of the major research activities in the current time. In general, altering the digital image...

متن کامل

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Recently permutation multimedia ciphers were broken in a chosen-plaintext scenario. That attack models a very resourceful adversary which may not always be the case. To show insecurity of these ciphers, we present a cipher-text only attack on speech permutation ciphers. We show inherent redundancies of speech can pave the path for a successful cipher-text only attack. To that end, regularities ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Signal, Image and Video Processing

سال: 2022

ISSN: ['1863-1711', '1863-1703']

DOI: https://doi.org/10.1007/s11760-022-02436-4